From a Computational Linguistic Atlas to Dialectal Lexical Resources
نویسندگان
چکیده
Computers can help dialectologists to make full use of the information they have acquired: the basic dimensions of dialectal reaserch can be enlarged and its possible outcomes can become more sophisticated. In this paper, we show how a dialectal database, DBT-ALT, containing the data collected for the Atlante Lessicale Toscano ‘Lexical Atlas of Tuscany’ can be used as the starting point for the production of dialectal dictionaries and other kinds of lexicographic resources provided that adequate computational tools are available to carry out the job properly. First, the architecture and functioning of DBT-ALT are described in detail. Second, we show how DBT-ALT access functionalities can be exploited to extract subsets of data which could be converted into independent lexicographic resources through the operation of a Lexicographic Workstation.
منابع مشابه
Dialectal resources on-line: the ALT-Web experience
The paper presents an on-line dialectal resource, ALT-Web, which gives access to the linguistic data of the Atlante Lessicale Toscano, a specially designed linguistic atlas in which lexical data have both a diatopic and diastratic characterisation. The paper focuses on: the dialectal data representation model; the access modalities to the ALT dialectal corpus; ontology-based search.
متن کاملPatterns of language variation and underlying linguistic features: a new dialectometric approach
For almost forty years quantitative methods have been applied to the analysis of dialect variation: these methods focused mostly on identifying the most important dialectal groups using an aggregate analysis of the linguistic data (Séguy 1973; Goebl 1984; Nerbonne et al. 1999). While viewing dialect differences at an aggregate level certainly gives a more comprehensive view than the analysis of...
متن کاملTharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
We introduce an electronic three-way lexicon, Tharwa, comprising Dialectal Arabic, Modern Standard Arabic and English correspondents. The paper focuses on Egyptian Arabic as the first pilot dialect for the resource, with plans to expand to other dialects of Arabic in later phases of the project. We describe Tharwa’s creation process and report on its current status. The lexical entries are augm...
متن کاملDialectal Atlas of the Arab World - between Intention and Reality
Arabic dialectology has a long history and achieved significant progress in collecting and analyzing linguistic data and its classification. The present paper analyses modern trends in the linguistic situation in the Arab world and defines the topics essential for the Arabic dialectology, which require an urgent solution. During the last century, several attempts have been undertaken to create ...
متن کاملDeveloping and Using a Pilot Dialectal Arabic Treebank
In this paper, we describe the methodological procedures and issues that emerged from the development of a pilot Levantine Arabic Treebank (LATB) at the Linguistic Data Consortium (LDC) and its use at the Johns Hopkins University (JHU) Center for Language and Speech Processing workshop on Parsing Arabic Dialects (PAD). This pilot, consisting of morphological and syntactic annotation of approxim...
متن کامل